A Unified Model for Unsupervised Opinion Spamming Detection Incorporating Text Generality

نویسندگان

  • Yinqing Xu
  • Bei Shi
  • Wentao Tian
  • Wai Lam
چکیده

Many existing methods on review spam detection considering text content merely utilize simple text features such as content similarity. We explore a novel idea of exploiting text generality for improving spam detection. Besides, apart from the task of review spam detection, although there have also been some works on identifying the review spammers (users) and the manipulated offerings (items), no previous works have attempted to solve these three tasks in a unified model. We have proposed a unified probabilistic graphical model to detect the suspicious review spams, the review spammers and the manipulated offerings in an unsupervised manner. Experimental results on three review corpora including Amazon, Yelp and TripAdvisor have demonstrated the superiority of our proposed model compared with the state-of-the-art models.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptable Text Filters and Unsupervised Neural Classifiers for Spam Detection

Spam detection has become a necessity for successful email communications, security and convenience. This paper describes a learning process where the text of incoming emails is analysed and filtered based on the salient features identified. The method described has promising results and at the same time significantly better performance than other statistical and probabilistic methods. The sali...

متن کامل

Topic-Level Opinion Influence Model(TOIM): An Investigation Using Tencent Micro-Blogging

Text mining has been widely used in multiple types of user-generated data to infer user opinion, but its application to microblogging turns out to be difficult, since text messages are short and noisy, providing limited information about user opinion. Given that microblogging users communicate each other to form a social network, we hypothesize that user opinion is influenced by its neighbors i...

متن کامل

An approach for detecting spam in arabic opinion reviews

For the rapidly increasing amount of information available on the Internet, little quality control exists, especially over the user-generated content. Manually scanning through large amounts of user-generated content is time-consuming and sometime impossible. In this case, opinion mining is a better alternative. Although, it is recognized that the opinion reviews contain valuable information fo...

متن کامل

Detecting Deceptive Opinion Spam using Linguistics, Behavioral and Statistical Modeling

With the advent of Web 2.0, consumer reviews have become an important resource for public opinion that influence our decisions over an extremely wide spectrum of daily and professional activities: e.g., where to eat, where to stay, which products to purchase, which doctors to see, which books to read, which universities to attend, and so on. Positive/negative reviews directly translate to finan...

متن کامل

Towards Accurate Deceptive Opinion Spam Detection based on Word Order-preserving CNN

As a mainly network of Internet naval activities, the deceptive opinion spam is of great harm. The identification of deceptive opinion spam is of great importance because of the rapid and dramatic development of Internet. The effective distinguish between positive and deceptive opinion plays an important role in maintaining and improving the Internet environment. Deceptive opinion spam is very ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015